Smoothing parameter estimation framework for IBM word alignment models

نویسندگان

  • Vuong Van Bui
  • Cuong Anh Le
چکیده

IBM models are important word alignment models in Machine Translation. Based on the Maximum Likelihood Estimation principle to estimate their parameters, the models could easily overfit training data when data are sparse. Even though smoothing is a very popular solution in Language Model, there is still a lack of studies on smoothing for word alignment. In this paper, we propose a framework which generalizes the notable work Moore (2004) of applying additive smoothing to word alignment models. The framework allows developers to customize the smoothing amount for each pair of words. The added amount will be scaled appropriately by a common factor which reflects how much the framework trusts the adding strategy according to the performance on data. We also carefully examine various performance criteria and propose a smoothened version of the error count, which generally gives the best result.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving IBM Word Alignment Model 1

We investigate a number of simple methods for improving the word-alignment accuracy of IBM Model 1. We demonstrate reduction in alignment error rate of approximately 30% resulting from (1) giving extra weight to the probability of alignment to the null word, (2) smoothing probability estimates for rare words, and (3) using a simple heuristic estimation method to initialize, or replace, EM train...

متن کامل

A Fast Fertility Hidden Markov Model for Word Alignment Using MCMC

A word in one language can be translated to zero, one, or several words in other languages. Using word fertility features has been shown to be useful in building word alignment models for statistical machine translation. We built a fertility hidden Markov model by adding fertility to the hidden Markov model. This model not only achieves lower alignment error rate than the hidden Markov model, b...

متن کامل

11-731 Class Project Report Estimating Better Position Alignments for Statistical Machine Translation

In Statistical Machine Translation, we often find forward or backword jumps while translating from a source position to a target position. We propose several position alignment models for estimating these jump probabilities. Our initial jump probability model is a coarse model with no dependencies. The maximum likelihood estimation of the jump probabilities is performed during post word alignme...

متن کامل

A Java Implementation of an Extended Word Alignment Algorithm Based on the IBM Models

In recent years statistical word alignment models have been widely used for various Natural Language Processing (NLP) problems. In this paper we describe a platform independent and object oriented implementation (in Java) of a word alignment algorithm. This algorithm is based on the first three IBM models. This is an ongoing work in which we are trying to explore the possible enhancements to th...

متن کامل

It Depends on the Translation: Unsupervised Dependency Parsing via Word Alignment

We reveal a previously unnoticed connection between dependency parsing and statistical machine translation (SMT), by formulating the dependency parsing task as a problem of word alignment. Furthermore, we show that two well known models for these respective tasks (DMV and the IBM models) share common modeling assumptions. This motivates us to develop an alignment-based framework for unsupervise...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016